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We consider the optimal scaling problem for high-dimensional 
random walk Metropolis (RWM) algorithms where the target distri- 
bution has a discontinuous probability density function. Almost all 
previous analysis has focused upon continuous target densities. The 
main result is a weak convergence result as the dimensionality d of 
the target densities converges to oo. In particular, when the proposal 
variance is scaled by the sequence of stochastic processes formed 
by the first component of each Markov chain converges to an appro- 
priate Langevin diffusion process. Therefore optimizing the efficiency 
of the RWM algorithm is equivalent to maximizing the speed of the 
limiting diffusion. This leads to an asymptotic optimal acceptance 
rate of (=0.1353) under quite general conditions. The results 
have major practical implications for the implementation of RWM 
algorithms by highlighting the detrimental effect of choosing RWM 
algorithms over Metropolis-within-Gibbs algorithms. 

1. Introduction. Random walk Metropolis (RWM) algorithms are widely 
used generic Markov chain Monte Carlo (MCMC) algorithms. The ease with 
which RWM algorithms can be constructed has no doubt played a pivotal 
role in their popularity. The efficiency of a RWM algorithm depends funda- 
mentally upon the scaling of the proposal density. Choose the variance of 
the proposal to be too small and the RWM will converge slowly since all its 
increments are small. Conversely, choose the variance of the proposal to be 
too large and too high a proportion of proposed moves will be rejected. Of 
particular interest is how the scaling of the proposal variance depends upon 
the dimensionality of the target distribution. The target distribution is the 
distribution of interest and the MCMC algorithm is constructed such that 
the stationary distribution of the Markov chain is the target distribution. 
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The Introduction is structured as follows. We outline known results for 
continuous independent and identically distributed product densities from 
[14] and subsequent work. We highlight the scope and limitations of the 
results before introducing the discontinuous target densities to be studied 
in this paper. While the statements of the key results (Theorem 2.1) in this 
paper are similar to those given for continuous target densities, the proofs 
are markedly different. A discussion of why a new method of proof is required 
for discontinuous target densities is given. Finally, we give an outline of the 
remainder of the paper. 

The results of this paper have quite general consequences for the im- 
plementation of Metropolis algorithms on discontinuous densities (as are 
commonly applied in many Bayesian Statistics problems), namely: 

(1) Full- (high-) dimensional update rules can be an order of magnitude 
slower than strategies involving smaller dimensional updates. (See Theo- 
rem 3.3 below.) 

(2) For target densities with bounded support. Metropolis algorithms can 
be an order of magnitude slower than algorithms which first transform the 
target support to for some d. 

In [14], a sequence of target densities of the form 

d 

(1-1) 7r,(x'^)=n/(^f) 

1=1 

were considered as (i — )• oo, where /(•) is twice differentiable and satisfies 
certain mild moment conditions; see [14], (Al) and (A2). The following ran- 
dom walk Metropolis algorithm was used to obtain a sample XQ,Xf,... 
from vrd(-). Draw Xq from TTd{-)- For t>0 and i = 1, 2, . . . , let ZtA be in- 
dependent and identically distributed (i.i.d.) according to Z ^ N{0,1) and 
Zf = {Zt,i,Zt^2, Zt4). At time t, propose 

(1.2) Y'^ = X,^ + adZf, 

where ad is the proposal standard deviation to be discussed shortly. Set 
Xf^i = with probability 

(1.3) a(Xf,Y'^) = lA^'^^''^ 



Otherwise set y>4+i ~ -^t • straightforward to check that {X^} has sta- 
tionary distribution '7r^(-), and hence, for all t > 0, X^ ~ '^d{')- The key 
question addressed in [14] was: starting from the stationary distribution, 
how should Gd be chosen to optimize the rate at which the RWM algorithm 
explores the stationary distribution? Since the components of Xj* are i.i.d., it 
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suffices to study the marginal behavior of the first component, Xf-^^. In [14], 
it was shown that if ad = l/Vd {I > 0) and Uf = Xj'^^j -^ (t > 0), then 

(1.4) W^^U asd^cx), 
where U. satisfies the Langevin SDE 

(1.5) dUt = y/h(l)dBt+<t>{l)^^^^dt 

with Uq ~ /(•) and h{l) = 2f'^{-l\/l /2) with $ being the standard normal 
c.d.f. and / = E/[{/'(X)//(X)}2]. Note that the "speed measure" of the 
diffusion (f){l) only depends upon / through /. The diffusion limit for 
is unsurprising in that for a time interval of length s > 0, 0{d) moves are 
made each of size 0{l/^fd). Therefore the movements in the first compo- 
nent (appropriately normalized) converge to those of a Langevin diffusion 
with the "most efficient" asymptotic diffusion having the largest speed mea- 
sure h{l). Since the diffusion limit involves speeding up time by a factor 
of (i, we say that the mixing of the algorithm is 0{d). The optimal value 
oil \s l = 2.38/ VI, which leads to an average optimal acceptance rate (aoar) 
of 0.234. This has major practical implications for practitioners, in that, to 
monitor the (asymptotic) efficiency of the RWM algorithm it is sufficient to 
study the proportion of proposed moves accepted. 

There are three key assumptions made in [14]. First, XQ~7rrf(-), that 
is, the algorithm starts in the stationary distribution and ad is chosen to 
optimize exploration of the stationary distribution. This assumption has 
been made in virtually all subsequent optimal scaling work; see, for exam- 
ple, [3, 7, 10, 11] and [15]. The one exception is [8], where Xq is started from 
the mode of ird{-) with explicit calculations given for a standard multivari- 
ate normal distribution. In [8], it is shown that ad = 0(1/ V^) is optimal for 
maximizing the rate of convergence to the stationary distribution. Since con- 
vergence is shown to occur within 0{logd) iterations, the time taken to ex- 
plore the stationary distribution dominates the time taken to converge to the 
stationary distribution, and thus overall it is optimal to choose ad = l/Vd. It 
is difficult to prove generic results for Xg 7^ iid- However, the findings of [8] 
suggest that even when Xq 7^ tt^, it is best to scale the proposal distribution 
based upon Xq ~ Tr,^. It is worth noting that in [8] it was found that for the 
Metropolis adjusted Langevin algorithm (MALA), the optimal scaling of ad 
for Xq started at the mode of a multivariate normal is 0{d~^^^) compared 
to 0(d-i/6) for X[J~7rrf. 

Second, T^di-) is an i.i.d. product density. This assumption has been re- 
laxed by a number of authors with ad = 0(1/ Vd) and an aoar of 0.234 
still being the case, for example, independent, scaled product densities ([15] 
and [3]), Gibbs random fields [7], exchangeable normals [10] and elliptical 
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densities [17]. Thus the simple rule of thumb of tuning ad such that one 
in four proposed moves are accepted holds quite generally. In [4] and [17], 
examples where the aoar is strictly less than 0.234 are given. These corre- 
spond to different orders of magnitude being appropriate for the scaling of 
the proposed moves in different components. 

Third, the results are asymptotic as d — )■ oo. However, simulations have 
shown that for i.i.d. product densities an acceptance rate of 0.234 is close to 
optimal for d= 10; see, for example, [10]. Departures from the i.i.d. product 
density require larger d for the asymptotic results to be optimal, but d = 
100 is often seen in practical MCMC problems. In [12] and [16], optimal 
acceptance rates are obtained for finite d for some special cases. 

With the exceptions of [11, 12] and [17], in the above works vr^ is as- 
sumed to have a continuous (and suitably differentiable) probability density 
function (p.d.f.). The aim of the current work is to investigate the situation 
where the target distribution has a discontinuous p.d.f., and specifically, tar- 
get distributions confined to the d-dimensional hypercube [0, 1]*^. That is, we 
consider target distributions of the form 

d 

(1-6) 7r,(x'^) = n/(^f), 

i=l 

where 

(1.7) /(x) OCexp(5r(x))l{o<a;<l} (xGM) 
and g{-) is twice differentiable upon [0, 1] with 

(1.8) g* = sup \g{y)\ < oo. 

0<y<i 

We then use the following random walk Metropolis algorithm to obtain 
a sample Xq,X^,... from vrrf(-). Draw Xq from 7rrf(-). For t>0 and i = 
1,2, ... , let Zti be independent and identically distributed (i.i.d.) according 
to Z ~ U[—l, 1] and = {Zti,Zt2, • • . , Zf^)- At time t, propose 

(1.9) Y'' = Xf + adZf. 
Set Xf^i = Y'^ with probability 

(1.10) a(Xf,Y'^) = lA^'^^^'^^ 



Otherwise set Xf^^ = X^. 

In [11] and [17], spherical and elliptical densities are considered which 
have very different geometry to the hypercube restricted densities. Therefore 
different approaches are taken in these papers with results akin to those 
obtained for continuous target densities. Densities of the form (1.7) have 
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previously been studied in [12], where the expected square jumping distance 
(ESJD) has been computed. The ESJD is 



(1.11) E, 



'I'd 



1=1 



the mean squared distance between Xq and Xf , where Xq ~ i^d- In [12], 
Appendix B, it is shown that for cTrf = //d (/ > 0) and f{x) = l{o<x'<i}) 



(1.12) dE 



i=l 



— exp(— -) asd— )-oo. 



Thus asymptoticaUy (as d — t- oo) the ESJD is maximized by taking / = 4 
which corresponds to an aoar of exp(— 2) (=0.1353). In this paper, we show 
that (T(i = l/d and an aoar of exp(— 2) holds more generally for target dis- 
tributions of the form given by (1.6) and (1.7). Moreover, we prove a much 
stronger result than that given in [12], in that, we prove that = Xfgj2T^ i 
converges weakly to an appropriate Langevin diffusion Vg with speed mea- 
sure c^{l) = {P/3) exp(-V(2/*)) as d ^ oo, where /* = lim,^o{(/(a;) + /(I - 
x))/2}. This gives a clear indication of how the Markov chain explores the 
stationary distribution. By contrast the ESJD only gives a measure of av- 
erage behavior and does not take account of the possibility of the Markov 
chain becoming "stuck." If E2d[a(x'^,x'^ + Z"')] is very low, the Markov chain 
started Xq = x'^ is likely to spend a large number of iterations at x'^ before 
accepting a move away from x'^. Note that since involves speeding up 
time by a factor of d^, we say that the mixing of the algorithm is O(d^). 
The ESJD is easy to compute and asymptotically, as d — )■ oo, the ESJD 
(appropriately scaled) converges to Thus in discussing possible exten- 
sions of the Langevin diffusion limit proved in Theorem 2.1 for i.i.d. product 
densities of the form given in (1.6) and (1.7), we make considerable use of 
the ESJD. However, we highlight the limitations of the ESJD in discussing 
extensions of Theorem 2.1. 

In most previous work on optimal scaling, the components of Z'^ are taken 
to be independent and identically distributed Z ~ A^(0, 1) random variables. 
The reason for choosing Z~ [/[— 1,1] for discontinuous target densities is 
mathematical convenience. The results proved in this paper hold with Gaus- 
sian rather than uniform proposal distributions, but some elements of the 
proof are less straightforward. For discussion of the ESJD for densities (1.6) 
for general Z subject to E[Z^] < oo, see [12], Appendix B. 

While the key result, a Langevin diffusion limit for the movement in the 
first component, is the same as [14], the proof is markedly different. Note 
that, for finite d, If^ and are not Markov chains since whether or not 
a proposed move is accepted depends upon all the components in iTdi-)- 
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In [14], it is shown that there exists {Fd} such that IP(Ul=o {^t* ^ ^d}) 
as d — oo and 



(1.13) 



sup 



E[a(x'^,x'^ + cjrfZ'')] -2^> 



where — )■ as d — )• oo. While (1.13) is not exphcitly stated in [14], it is the 
essence of the requirements of the sets {Fd}, stating that for large d, with 
high probability over the first Td iterations the acceptance probability of the 
Markov chain is approximately constant, being within of 2<1>(— Z-v/7/2). 
(Note n rather than d is used for dimensionality in [14].) Thus in the limit 
as d — )• oo the effect of the other components on movements in the first 
component converges to a deterministic acceptance probability 2<I>(— /\/7/2). 
The situation is more complex for 7rd{-) of the form given by (1.6) and (1.7) 
as the acceptance rate in the limit as d — t- oo is inherently stochastic. For 
example, suppose TTdi-) is the uniform distribution on the d-dimensional 



hypercube so that a(X^, Y^) 
(1 - ad, 1), this gives 



1{Y''g[o,i]'^}- Letting = {0,ad) and R'^ 



(1.14, EWx^x^+..z^)i= n (5+^) X n 

Thus the acceptance probability is totally determined by the components at 
the boundary (within ad of or 1). The total number of components in R^ U 
R^ is Bm{d,2l/d) which converges in distribution to Po(2/) as d — t- 00. Thus 
the number of components close to the boundary is inherently stochastic. 
Moreover, the location of the components within R^ U i?^ plays a crucial 
role in the acceptance probability; see (1.14). Therefore there is no hope of 
replicating directly the method of proof applied in [14] and subsequently, 
in [7] and [10]. 

We need a homogenization argument which involves looking at X^^ over 
[d^] steps; cf. [11]. In particular, we show that the acceptance probability 
converges very rapidly to its stationary measure, so that over [d^] iterations 
approximately exp {-lf*/2)[d^] proposed moves are accepted. By compari- 
son, l^j^i] ^ — Xq i\ < [d^]ad; thus the value of an individual component only 

makes small changes over [d^] iterations. That is, we show that there exists 

{Fd} such that, for any T>0, F{[J^^f\xf ^ Fd}) ^ as d ^ cx) and for 
6>0, 



L J2 E[a{Xf,Xl + adZf)\Xt = A-ew(-^-Y-) 



<ed 



(1.15) sup 

x^G-Fd 

for some — t- as d— )• 00. For large d, with high probability over the first 
[Td'^] iterations the Markov chain stays in Fd, where the average number of 
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accepted proposed moves in the following [d^] iterations is exp{—lf*/2)d^ + 
o{d^). The arguments are considerably more involved than in [11], where 
spherically constrained target distributions were studied, due to the very 
different geometry of the hypercube and spherical constraints applied in 
this paper and [11], respectively. In particular, in [11], (Tfi = Ij^fd with an 
aoar of 0.234. 

By exploiting the homogenization argument it is possible to prove that 
converges weakly to an appropriate Langevin diffusion y, given in Theo- 
rem 2.1. In Section 2, Theorem 2.1 is presented along with an outline of 
the proof. Also in Section 2, a description of the pseudo-RWM algorithm is 
given. The pseudo-RWM algorithm plays a key role in the proof of Theo- 
rem 2.1. The pseudo-RWM process moves at each iteration and the moves 
in the pseudo-RWM process are identical to those of the RWM process, con- 
ditioned upon a proposed move in the RWM process being accepted. The 
proof of Theorem 2.1 is long and technical with the details split into three 
key sections which are given in the Appendix; see Section 2 for more details. 
In Section 3, two interesting extensions of Theorem 2.1 are given. In particu- 
lar, Theorem 3.3 has major practical implications for the implementation of 
RWM algorithms by highlighting the detrimental effect of choosing RWM al- 
gorithms over Metropolis-within-Gibbs algorithms. The target densities for 
which theoretical results can be proved are limited, so discussion of possible 
extensions of Theorem 2.1 are given. In particular, we discuss general vr^ 
restricted to the hypercube, general discontinuities in / and Xq ^ iTd- 

2. Pseudo-RWM algorithm and Theorem 2.1. We begin by defining the 
pseudo-random walk Metropolis (pseudo-RWM) process. We will then be in 
position to formally state the main theorem. Theorem 2.1. An outline of the 
proof of Theorem 2.1 is given, with the details, which are long and technical, 
placed in the Appendix. 



For d> 1, let 



2-'^, if z'^e (-1,1)'^, 
0, otherwise. 



Let J(i(x'^) denote the probability of accepting a move in the RWM process 
given the current state of the process is x"'. Then 

(2.1) j,(x^)= /Mz^)|iA "'^^"'+r'n ^-^- 



7rd(x'^) 

Let 6^(x'^) = Yl'j=i ^{xjGR]^}i the total number of components of x'^ in K'^ = 
(0,r/d) U (1 — r/d, 1). By Taylor's theorem for all < Xi,Xi + a^Zi < 1 and 
-l<^i<l, 

(2.2) g{xi + aaZi) - g{xi) > -g*ad 
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with g* defined in (1.8). Hence, for all x"* e [O,!]"^, 

> j /id(z°'){l Aexp(-d5rVd)}l{^d+^^2dg[o,i]d|(iz'^ 
= exp(-/g*) j /irf(z'^)l|^d+^^^dg[o,i]d}(iz'^ 



1\ "d 



(2.3) 



>exp(-/g*)(^- 

This lower bound for Jd(x'^) will be used repeatedly. 

The pseudo-RWM process moves at each iteration, which is the key dif- 
ference to the RWM process. Furthermore, the moves in the pseudo-RWM 
process are identical to those of the RWM process, conditioned upon a move 
in the RWM process being accepted, that is, its jump chain. For d > 1, let 
XQ,Xf, . . . denote the successive states of the pseudo-RWM process, where 
Xq ~ vrd(-). The pseudo-RWM process is a Markov process, where for t > 0, 
X^^^^ = X^^ + aaZf and given that Xf = x'^, Zf has p.d.f. 

C(zV^) = /^d(z^)«(x^x^ + arfZ^)/J,(x'^), z^'g (-1,1)^. 

Note that ((z'^jx'^) = for z"^ ^ (-1, l)"*. Since X|^,X[5 ~ vr^, we can couple 
the two processes to have the same starting value Xq. A continued coupling 
of the two processes is outlined below. Suppose that X^ = x'^. Then for any 
s > 1, 



(2.4) p(^Ui^'+.=^'}l^' = ^'j =(l-'^<^(^'))'- 

That is, the number of iterations the RWM algorithm stays at x^ before 
moving follows a geometric distribution with "success" probability Jrf(x'^). 
Therefore for j > 0, let Mj(-) denote independent geometric random vari- 
ables, where for <p< 1, Mj{p) denotes a geometric random variable with 
"success" probability p. For s G Z+, let = M,(J(Xf )) and for t G Z+, let 

Uf = supjs e : E MjiJdi^j)) < i| , 

where the sum is zero if vacuous. For s G Z"*", attach = Afs( J(X^)) to X^. 
Thus Mg denotes the total number of iterations the RWM process spends 
at Xf before moving to X^^^. Hence, the RWM process can be constructed 
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from {±^,M^),{±f,Mf),... by setting = ±^ and for all s > 1, Xf = 
X^^. Obviously the above process can be reversed by setting X^ equal to 
the tth accepted move in the RWM process. 

For each d> 1, the components of Xq are independent and identically 
distributed. Therefore we focus attention on the first component as this is 
indicative of the behavior of the whole process. For d> 1 and t >0, let 
^/ = ^[.^*],i andy/ = Xf,,,j^^. 

Theorem 2.1. Fix / > 0. For a// d > 1, /ei Xq ~ vr^. Then, as d ^ co , 

in the Skorokhod topology on L'[0,oo), where V. satisfies the (reflected) Langevin 
SDE on [0, 1] 

(2.5) dVt = ^Ml)dBt + Ima'iVt) dt + dL'iiV) - dL]{V) 
with Vq ~ / . Note that Bt is standard Brownian motion, 

</>(0 = -exp^-— J 

and r=iun^^,(m±m^). 

Here {L^,t > 0} denotes the local time ofV aty (=0, 1) and the SDE (2.5) 
corresponds to standard reflection at the boundaries and 1 (see, e.g.. Chap- 
ter VI of [13]). 

Proof. As noted in Section 1, the acceptance probability of the RWM 
process is inherently random and therefore it is necessary to consider the 
behavior of the RWM process averaged over [d^\ iterations, for 5 > 0. Fix 
< 2O7 </3<(5<(5 + 7<^ and let {kd} be a sequence of positive integers 
satisfying [df^] <kd< [d^]. For s G Z+, let Xf = Xfj^.j and for t > 0, let = 
^dy[d%r For ah t > 0, \Xf_,,^, - Xf^,\ < and \[dh] - [d^] x [dH/[d^]]\ < 
[d^]. Hence, for all T > 0, 

(2.6) sup \Vf-Vf\<[d^]ad. 

0<s<T 

Therefore by [5], Theorem 4.1, =^ V as d — )■ 00, if V"^ =^ V as d ^ 00. 
Hence we proceed by showing that 

(2.7) V'^^V asd^oo. 

Let be the (discrete-time) generator of X'^ and let H be an arbitrary 
test function of the first component only. Thus 

(2.8) G^^(x'^) = ^E[/7(Xf) - H{X.'^)\X.^ = 
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The generator G of the (Hmiting) one-dimensional diffusion V for an arbi- 
trary test function H is given by 

(2.9) GH{x) = (t){l){\g {x)H' {x) + \H"{x)] 

for all x G [0, 1] at least for all H £T>, where T> is defined in (2.10) below. 

First note that the diffusion defined by (2.9) is regular; see [9], page 366. 
Therefore by [9], Chapter 8, Corollary 1.2, it is sufficient to restrict attention 
to functions 

(2.10) Hev = {h:he C{[o, 1]) n ^^((o, i)) nv*,Ghe C{[o, i])}, 

where C^((0, 1)) is the set of twice differentiable functions upon (0, 1), C[0, 1] 
is the set of bounded continuous functions upon [0, 1] and P* is obtained by 
setting qi = {i = 0, 1) in [9], page 367, (1.11) and is given by 

(2.11) V* = {h:h'{0) = h'{l) = 0}. 

Let Hf = supo<y<i i^'(y) and = supo<j^<i i^"(y). Then H e C2((0,l)) 
combined with H G T>* implies that < oo. It then follows from g' being 
bounded on [0, 1] and GH £ C{[0, 1]) that H2 < 00. These observations will 
play a key role in Appendix C. 

Now (2.7) is proved using [9], Chapter 4, Corollary 8.7, by showing that 
there exists a sequence of sets {Fd} such that for any T > 0, 

(2.12) P J {X'^^Fd}\^0 asd^oo 
and 

(2.13) sup \G^aH{x.'^) -GH {xi)\^0 asd^oo. 
Let the sets {F^} and {F^} be such that F^ = 0^=1 -^j 

(2.14) Frf=|x'^;P^|Ji^J^^^'^}l^o = x''j <(i-'|, 

where Fj, Fj, F| and F^ are defined below. Recall that b'^{x.'^) = Yl'j=i 1{x-jGR^}) 
the total number of components of x'^ in = {0,r/d) U (1 — r/d, 1). We 
term R^^ the rejection region, in that, for any component in i?^, there is 
positive probability of proposing a move outside the hypercube with such 
moves automatically being rejected. Let 

(2.15) F] = {x'^;6[,(x'^)<7log4, 
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(2.16) Fj= n {x'^H^r (x^)-E[6r (X^)]l<^^}, 

k=[dfi] 

(2.17) F| = |x'^; sup sup |Ad(x'^; r; fc^) - A(r)| < d"^), 

^ [dP\<ka<[d'\0<r<l 



(2.18) Fi = x'^; 



d ■ 1 



where Ad(x'^; r; fc^) = E[6;^(Xf J|X|^ = x'^] and A(r) = /*r(l + r/20. In Ap- 
pendix A, we prove (2.12) for the sets {Ffj^} given in (2.14). Note that (2.12) 
follows immediately from Theorem A. 13, (A. 74) since Xg~7rd. An outline 
of the roles played by each (j = 1,2,3,4) is given below. For x*^ G F^ 
(x*^ G F^) the total number of components in (close to) the rejection region 
are controlled. For x'^ G Fj^ after iterations the total number and position 
of the points {X^^|Xq = x'^} in i?^ are approximately from the stationary 

distribution of X'^. Finally, for x'^ E F^, 3 Ei=i f'C^^i)^ ~ %b'(^)^]; this 
is the key requirement for the sets {F^} given in [14], cf. [14], page 114, 

Rn(,X2 5 • • • , Xfi). 

The proof of (2.13) splits into two parts and exploits the pseudo-RWM 
process. Let 

(2.19) = maxj^ = 0,1, . . . ,[d' - !]■ ±-^J^^ M,{UXf)) < l| /[d% 

the proportion of accepted moves in the first [d^] iterations, where the sum 
is set equal to zero if vacuous. Then Xf = Xj'^ij = X.^p^^^^ and 

(2.20) G^i/(x'^) = ^^E[H{±^p^,s]) - i^(X^)|X^ = x"^]. 

In Appendix B, we show that for all x"' G Fa, Pd\^t = x'^ ^ exp(-//72) 
as d — 7- 00. Consequently, it is useful to introduce G^^^ H{x'^) (0 < vr < 1) 
which is defined for fixed < vr < 1 as 

i/(x'^) = ^E[(//(Xf^,.j) - H{±tmt = A 

(2.21) ^ mi^Ui)-^(^')\^t=A 
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[7rd*-l] 



E[G,i/(X,^)|X^ = x^] 



j=0 



where 



(2.22) 



Finally in Appendix C, we prove in Lemma C.6 that 
(2.23) sup sup \G^/H{x.'^) -GH {xi)\^0 asd^oo. 

0<7r<lxd(=F^ 

The triangle inequality is then utilized to prove (2.13) in Lemma C.6 us- 
ing (2.23) and Pd\^o = x'^ ^ exp(-Z/V2) asd^oo. □ 

It should be noted that in Appendix C, we assume that E[g'(X)^] > 0, 
in particular in Lemma C.l. In Appendixes A and B we make no such as- 
sumption. However, E[(/'(X)^] = corresponds to f{x) = l{o<x<i} (uniform 
distribution), and proving Lemma C.6 in this case follows similar but simpler 
arguments to those given in Appendix C. 

A key difference between the diffusion limits for continuous and discon- 
tinuous i.i.d. product densities is the dependence of the speed measure (p{l) 
upon /. For continuous (suitably differentiable) /, (/>(/) depends upon / = 
¥,f[{f'{X)/f[X)}'^], which is a measure of the "roughness" of /. For discon- 
tinuous densities of the form (1.7), (p{l) depends upon /* = lim^|o{(/(3^) + 
/(I — x))/2}, the (mean of the) limit of the density at the boundaries (dis- 
continuities). Discussion of the role of the density / in the behavior of the 
RWM algorithm is given in Section 3. 

The most important consequence of Theorem 2.1 is the following result. 

Corollary 2.2. Let a{l) = exp(-/*//2). Then 

E^^E[Jrf(X[f)] ^ a(/) asd^oo. 
is maximized hy 1 = 1 = 4//* with 

a(/) = exp(-2) = 0.1353. 

Clearly, if /(•) is known, / can be calculated explicitly. However, where 
MCMC is used, /(•) will often only be known up to the constant of pro- 
portionality. This is where Corollary 2.2 has major practical implications, in 
that, to maximize the speed of the limiting diffusion, and hence, the efficiency 
of the RWM algorithm, it is sufficient to monitor the average acceptance rate, 
and to choose I such that the average acceptance rate is approximately e~^. 
Therefore there is no need to explicitly calculate or estimate the constant of 
proportionality. 
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3. Extensions. In this section, we discuss the extent to which the con- 
clusions of Theorem 2.1 extend beyond vr^ being an i.i.d. product density 
upon the d-dimensional hypercube and Xq ~ vr^. First we present two exten- 
sions of Theorem 2.1. The second extension, Theorem 3.3, is an important 
practical result concerning lower-dimensional updating schema. 

Suppose that /(•) is nonzero on the positive half-line. That is, 

(3.1) /(x) « exp(5(x)) (x>0) 

and f{x) = otherwise. 

Theorem 3.1. Fix I > 0. For all d>l, let Xq ~ TTd, given by (3.1), with 
snY>x>Q\g' {x)\ = g* < CO . Then, asd^oo, 

in the Skorokhod topology on D[0, oo), where V. satisfies the (reflected) Lan- 
gevin SDE on [0,oo) 

dVt = y^dBt + lcl){l)g'{Vt) dt + dL'iiV) 

with Vo ~ /, (l){l) = !j exp{-f*l/4) and f* = lim^.^o fix). 

Proof. The proof of the theorem is virtually identical to the proof of 
Theorem 2.1, and so, the details are omitted. □ 

Note that we have assumed that g'{-) is bounded on [0,oo). This as- 
sumption is almost certainly stronger than necessary with g'{-) being Lips- 
chitz and/or satisfying certain moment conditions probably being sufficient; 
cf. [14]. 

Theorem 3.1 is unsurprising with the speed of the diffusion depending 
upon the number of components close to the discontinuity at 0. 

Corollary 3.2. Let 7rrf(x'^) = Y\i=if{xi) where f satisfies (3.1). Then 
E^, [ Jrf(X^)] ^ exp(-r//4) = a{l) asd^oo. 
(f)[l) is maximized by 1 = 1 = 8/f* with 

a(/) = exp(-2) = 0.1353. 

Therefore the conclusions are identical to Corollary 2.2 that in order to 
maximize the speed of the limiting diffusion it is sufficient to choose / such 
that the average acceptance rate is e~^. 

The second and more important extension of Theorem 2.1 follows on 
from [10]. In [10], the Metropolis- within-Gibbs algorithm was considered. 
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where only a proportion c (0 < c < 1) of the components are updated at each 
iteration. For given > 1, at each iteration Cdd of the components are chosen 
uniformly at random and new values for these components are proposed 
using random walk Metropolis with proposal variance c^c^ = i^/d)^- The 
remaining (1 — Cd)d components remain fixed at their current values. Finally, 
it is assumed that q — )• c as d — )• cxd. 

The following result assumes that /(•) is nonzero on (0,1) only. The ex- 
tension to the positive half-line is trivial. 

Theorem 3.3. FixO<c<l andl>0. For all d> I, letX.^ = {X^^^,X^2^ 
. . . be such that all of its components are distributed according to /(•). 

Then, as d — >• oo, 

in the Skorokhod topology, where Vq ~ /(•) and V satisfies the (reflected) 
Langevin SDE on [0, 1] 

dVt = ^/M^dBt + \(t)c{l)g'{Vt) dt + dLO(y) - dL\{V), 

where Bt is standard Brownian motion, (j)c{l) = ^ exp(— c/*//2) and f* = 

iinia;4,0 2 • 

Let a'^{l) denote the average acceptance rate of the RWM algorithm in d 
dimensions where a proportion q of the components are updated at each 
iteration. Let 

a^(/) =exp(-c/*//2). 
We then have the following result which mirrors Corollaries 2.2 and 3.2. 

Corollary 3.4. Let c^^ c as d^ oo. Then 
a''j'{l) ^ a''{l) asd^oo. 
For fixed < c < 1, 0c (0 maximized by 




c 

Also 

a(fc) =exp(-2) = 0.1353. 
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Corollary 3.4 is of fundamental importance from a practical point of view, 
in that it shows that the optimal speed of the limiting diffusion is inversely 
proportional to c. Therefore the optimal action is to choose c as close to 
as possible. Furthermore, we have shown that not only is full-dimensional 
RWM bad for discontinuous target densities but it is the worst algorithm of 
all the Metropolis-within-Gibbs RWM algorithms. 

We now go beyond i.i.d. product densities with a discontinuity at the 
boundary and Xq ~ t^^. We consider general densities on the unit hypercube, 
discontinuities not at the boundary and Xq 9^ vr^. As mentioned in Section 1, 
for i.i.d. product densities, the speed measure of the limiting one-dimensional 
diffusion, (t){l), is equal to the limit, as d — )• 00, of the ESJD times d. Therefore 
we consider the ESJD for the above-mentioned extensions as being indicative 
of the behavior of the limiting Langevin diffusion. We also highlight an extra 
criterion which is likely to be required in moving from an ESJD to a Langevin 
diffusion limit. 

Using the proof of Theorem 2.1, it is straightforward to show that 

f (If* 
<^(/) = -exp(^- — 

(3-2) = l^nzl] lim 



— hm E 

3 d— >oo 



The first equality in (3.2) can be proved using Lemma A. 6, (A. 26), where for 
Zi U (—1, 1), IE[^i] = 1/3. The second equality in (3.2) comes from the fact 
that for < X < ad, f{x) + f{l — x) = 2f* + 0{l/d) and for a component Xq^ 

uniformly distributed on {0,l/d) or {l-l/d,l), P(X^ • + e [0, 1]) = 3/4. 
That is, the acceptance probability of a proposed move is dominated by 
whether or not the proposed move lies inside the d-dimensional unit hyper- 
cube. Proposed moves inside the hypercube are accepted with probability 
1 — o{d~°^) for any a < 1/2; see Lemma A. 7. Thus it is the number and 
behavior of the components at the boundary of the hypercube (the discon- 
tinuity) which determine the behavior of the RWM algorithm. This is also 
seen in Theorems 3.1 and 3.3. 

First, we consider discontinuities not at the boundary. Suppose that 
7rd(x'^) = Hj'Li f(xi), where 

(3.3) f{x) oc l{xe[a,b]} exp(s'(x)) (x G M) 

for some a,6G M. Further suppose that g{-) is continuous (twice differen- 
tiable) upon [a, b] except at a countable number of points, V = {ai, 02, . . . , ak}, 
say, on (a, b). Set oq = a and flfc+i = b, with = l/d. For j = 0, 1, . . . , fc + 1, 
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let fj = lima:^aj~ fix) and /+ = lim^ 
Y+ ~ Po(Z/+/4), where = = 
(4.23), we can show that d times the ESJD 



f{x), with yr ~ Po(//r/4) and 
0. Then fohowing [12], 



^k+l 



(3.4) dE 



X, 



d \2 
OA) 



-E 
3 



k+l 



j=0 



as d - 



■ oo. 



Thus the optimal scaling of is again of the form I / d and the acceptance or 
rejection of a proposed move is determined by the components close to the 
discontinuities. Furthermore, it is straightforward to show that for each j = 

0, 1, . . . , fe + 1, ^'^ifj'/fj')^^ as / — oo, implying that the optimal 

choice of I lies in (0,oo). Proving a Langevin diffusion for the (normalized) 
first component of the RWM algorithm should be possible with appropriate 
local time terms at the discontinuities in /. While (3.4) holds regardless of fj' 

and fj~ for a diffusion limit we require that mini<j<fc+i /~, mino<j<A; fj' > 0, 
that is, the density is strictly positive on {a,b). (If this is not the case, the 
RWM algorithm is reducible in the limit as d — )• oo.) Extensions to the case 
where either a = — oo and/or b = oo are straightforward. 

Second, we consider general densities which are zero outside the d-dimen- 
sional hypercube, 7rrf(x'^) oc l|x<ie[o,i]d} exp(/i(i(x^)), where Hd{') is assumed 
to be continuous and twice differentiable. Let ad = l/d and assuming that 

(3.5) exp(//rf(X^ + aaZf) - /.^(X^)) 

we have that d times the ESJD satisfies 



1 



as d ■ 



■ oo, 



(3.6) dE 



■ d 

E 

,i=l 



^ - hm E 






3 d— >oo 





as d - 



■ oo. 



Note that (3.5) is a weak condition and should be straightforward to check 

using a Taylor series expansion of fid- For i.i.d. product densities, 6^(Xq) 
Po(2Z/*) as (i— 7- oo. More generally, the limiting distribution of 6^(Xg) will 
determine the limit of the right-hand side of (3.6). In particular, so long 
as there exist 5 >0 and K G N such that P(limrf^oo b^(Xo) < K) > 5, the 
right-hand side of (3.6) will be nonzero for I > 0. It is informative to con- 
sider what conditions upon vr^ are likely to be necessary for a diffusion 
limit, whether it be one-dimensional or infinite-dimensional as in [7]. Sup- 
pose that 6|^(Xq) B as d— >■ oo. For a diffusion limit we will require mo- 
ment conditions on B, probably requiring that there exists e > such that 
E[exp(ei?)] < oo. This will be required to control the probability of the RWM 
algorithm getting "stuck" in the corners of the hypercube. This highlights 
a key difference between studying the ESJD and a diffusion limit. For the 
ESJD, we want a positive probability that the total number of components 
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at the boundary of the hypercube is finite in the hmit as (i — )• oo. For the dif- 
fusion hmit, as seen with the construction of {F^} in Theorem 2.1, we want 
that the probabihty of there being a large number of components (0(log(i)) 
at the boundary is very small (o((i~^)). 

Third, suppose that Xq 9^ vr^. There are very bad starting points in the 
"corners" of the hypercube. For example, if Xq = (exp(— d), exp(— d), . . . , 
exp(— d)), Jd(XQ) (0.5 + exp(— d))"' which even for d = 100 is less than 
1 X 10""^*^. Thus the RWM process is likely to be "stuck" at its starting 
point for a very long period of time. This is rather pathological and a more 
interesting question is the situation when Xq = S^, where the components 
of S"^ are i.i.d. In particular, suppose that Sf ~ C/[0, 1], so that Xg is chosen 
uniformly at random over the hypercube. Note that, if S'^ is the uniform 
distribution, 



(3.7) dE 



E( vd yd \2\-v-d c(i 
~ ^0,i) 1^0 — 



— > — exp ( ~ 2 1 as d — )■ 00 



with the right-hand side maximized by taking I = 4 compared with I = 4//* 
for Xq ~ TTd- We expect to see similar behavior to [8], in that the optimal ad 
(in terms of the ESJD) will vary as the algorithm converges to the station- 
ary distribution but will be of the form ad = l/d throughout. The RWM 
algorithm is unlikely to get "stuck" with it conjectured that for any T > 
and 7 > 0, 

P( U {b^-X'i) > 7logd}|X^ ^ S-^j ^ as d ^ 00. 

Simulations with f{x) oc l{o<a;<i} exp(— 2x) and f{x) oc l{o<a;<i} exp(— (x — 
0.5)^/2) and d= 10, 20, . . . , 200 suggest that convergence occurs in O(d^) 
iteration. For convergence, we monitor the mean of Xf for /(x) oc 
l{o<a:<i} exp(— 2x) and the variance of Xf for f{x) oc 1{q<^.<i} exp( — (x — 
0.5)2/2). 

APPENDIX A: CONSTRUCTION OF THE SETS {Fd} AND {Fd} 

The sets Fd consist of the intersection of four sets F^ (i = 1,2,3,4). For 
i = l,2, 3, 4, we will define F^ and discuss the role that it plays in the proof 
of Theorem 2.1, one at a time. Furthermore, we show that in stationarity 
it is highly unlikely that Xf does not belong to Fd- Since we rely upon 
a homogenization argument, it is necessary to go further than the sets Fd 
to the sets Fd C Fd- In particular, if Xq G Fd, then it is highly unlikely that 
any of X^,X2, . . . 5X|^^4j do not belong to Fd- The above statement is made 

precise in Theorem A. 13 below, where the constructions of {Fd} and {Fd} 
are drawn together. 
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It is possible that all d components of Xq are in i?^. However, this is highly 
unlikely and we show in Lemma A.l that with high probability, there are 
at most 7logd components in the rejection region. Let Fj = {x'^;6^(x'^) < 
7 log 4. 

Lemma A.l. For any k>Q, 

d^¥{y4 i Fd) ^0 asd^oa. 

Proof. Fix k > 0. Note that ^ F] if and only if 6^(X[J) > 7logd. 
However, 

/ Ajd 

6^(X^)~Bin(^d,y^ {f(x)^f{\-x))dx 
with 

Aid 

(A.l) d\ {f{x) + f{l-x)}dx^2f*l asfi^oo. 

Jo 

Fix p > K/7. By Markov's inequality and using independence of the com- 
ponents of Xq, 

d'^P(6^(X^)>7logd) 

< d'^E[exp(p6^(X^))]/exp(p7logd) 

(A.2) =d^E[eMPhxl,eR',})f/d'^ 

l/d \ d 

{f{x) + f{l-x)}dx] /df^ 



= d''(^l + {eP-l) 
<d'^-^Texp(^(e^-l)d^ 



l/d 

{/(x) + /(l-x)}dx). 



The lemma follows since (A.l) implies that the right-hand side of (A.2) 
converges to as (i — )■ 00. □ 

For x'^ G F], it foUows from (2.3) that 

(A.3) Jd(x^) > exp(-/5*)2-^^^(^') > exp{-lg*)2-^^"^'^ > exp{-lg*)d-'^ . 

This is a useful lower bound for the acceptance probability and as a result 
the random walk Metropolis algorithm does not get "stuck" at values of 
x'^ G FJ. To assist with the homogenizing arguments, we define {F^} by 

(A.4) Fi = |x'^;p(^|Jx^^^F]|X^ = x'^^ <d-3|. 

That is, by starting in Fj it is highly unlikely that the pseudo-RWM algo- 
rithm leaves Fj in [d^] iterations. To study Fj and later F^ we require the 
following lemmas. 
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Lemma A. 2. For a random variable X, suppose that there exist (5, e > 
such that 

(A.5) ¥{XeA\XeB)<5e 

and for all x G , ¥{X G A\X = x)>e. Then 

(A.6) ¥{X ^D\X ^B)<5. 

Proof. First note that 

P(X G A\X £B)> F{X £Ar\X£ D^\X G B) 

(A.7) 

= P(X G A\X eD^,X£ B)F{X ^ D\X G B). 

The lemma fohows from rearranging (A.7) and using (A.5) and ¥{X G A\X G 
D'^,X € B)>£. □ 

Lemma A. 3. Suppose that a sequence of sets {F^} is such that there 
exists K> such that 

(A.8) d''F{X.^(^F^)^0 asd^oo. 

Fix £ > and let 

(A.9) F,*=|x'^;p(^UiXf^FlnF]}|X^ = x'^j <d-|. 

Then 

(A.IO) d^-(2+5+7+^)p(X[J ^ F^) ^0 asd^oo. 

Proof. Since Xf ~ tt^, 

(A.ll) Fi(j {Xf^F^nFilj <d2+^+^P(X^^F,*nFi). 
Therefore for all sufficiently large d, 

(A.12) P(X^ G F^ n Fi) > 1 - P(X^ ^ FJ) - P(X^ ^ F]) > i. 

By Bayes's theorem, ¥{A\B) = F{AnB)/F{B) < F{A)/F{B). Therefore tak- 
ing A = u!£o '^^'{Xf ^ F* n F]} and S = {XjJ G F* n F]}, it follows from 
(A.ll) and (A.12) that 

'f"''^'^^' ^ d2+'5+TP(XJ^ ^ F^^ n F]) 



U {xf ^ F,^ n Fi}|X^ G F^ n Fij < 



1/2 



20 P. NEAL, G. ROBERTS AND W. K. YUEN 

Let 

(A.14) Fl = |x'^;Pf U {Xf^FlnFi}|X^ = x'^J <d-|. 
It follows from Lemmas A.l and A. 2 that 

(A.15) (i''-(2+^+^+^¥(X;^ i F^\X.f, G n F}) ^0 as d ^ oo. 

Since d'"P(X;^ ^ n F}) ^ as d ^ oo, it follows from (A.15) that 

(A.16) (i''-(2+5+7+^)p(x^ ^ F^) ^0 as d ^ oo. 

For d> 1 and i = 0,1,2,..., let {Of} be independent and identically 
distributed Bernoulli random variables with P(0q = 1) = exp(— /(/*)2~'''^°^'^ 
where g* = max|o<x<i}|5'(2;)|- It is straightforward using Hoeffding's in- 
equality to show that 

(A.17) d'^Pl 0f<dM^O asd^oo. 

Now {9'^^} and {X^} can be constructed upon a common probability space 
such that if 9j = 1 and X^ G F], X^_^^ / X^. For A;,n > 0, consider Xf , if 
Y17=i^i ^ ^ and n"=o{Xj ^ ^d* -^il' a coupling exists such that there 
exists Jk£{k,k + l,...,n} such that Xf = Xj^ € F^CiF}. Exploiting the 

above coupling, C]f=o'^''\'^j ^ n F]} and EUT'^^^ > together im- 
ply that rffoi^j e Fd n i^]}- Thus 



(A.18) P(X^^Fl)<P(X^^Fi)+P Yl ^f<d' 

\ i=l I 

and (A. 10) follows from (A.16), (A.17) and (A.18). □ 

As noted in Section 2, we follow [11] by considering the behavior of the 
random walk Metropolis algorithm over steps of size \S\ iterations. We find 
that a single component moves only a small distance in \d^\ iterations, while 
over \d^\ iterations the acceptance probability, which is dominated by the 
number and position of components in i?^, "forgets" its starting value. More- 
over, we show that approximately exp (-/*//2)[d^] of the proposed moves 
are accepted. However, we need to control the number of components which 
are dost to the rejection region (Fj) and the distribution of the position 
of the components in the rejection region after [d^] iterations (F|), where 
0</3<(5. 
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For any k>l, let 

F!{k) = {x'^ : \bf\^') - E[bf\jLt)]\ < ^} 

and let 

(A. 19) F|= n F!ik). 

Before studying i^J, we state a simple, useful result concerning the central 
moments of a sequence of binomial random variables. 

Lemma A. 4. Let ~ Bin((i,p^). Suppose that p^^O and dpd — t- oo as 
ci — )• oo; then for any m G N, 

m 

(A.20) ¥.[{Bd-nBd]mdpdT^J{{2j-l) asd^oo. 

Lemma A. 5. For any k> and sequence of positive integers {kd} sat- 
isfying [d^]<kd<[d^], 

(A.21) d''¥{X.'^(^Fj{kd))^0 asd^oo. 

Consequently, for any k>0, d'^F{X.Q ^ FJ) — > as d^ oo. 



Proof. Fix k> 0. By stationarity and Markov's inequality, for all m G N, 

,3/4 ,3/4 

d^nxUFi{kd))=d^n\b'/ {:^t)-nb'/ (x^)]|>y^) 



(A.22 



Jft ,3/4 ,3/4 

<^E[{b''/ {^i)-E[b''/ (X^)])'n 



^3/4 h''l^ld 

However, 6/ (X[j) ~ Bin((i, /q {/(x) + /(I - x)} dx), so by Lemma A.4 
for any m G N, for all sufficiently large d, 

E[(6f V^) - IE[# V^)])"T < K^^T"^ 

where /i'm = njLi(2i - 1) + 1- Since k^ > K], the right-hand side of (A.22) 
converges to as d — )■ oo by taking m > ^k/ f3, proving (A.21). 
Note that 

(A.23) d^FiXt i Fl) <d^Y. IP(^o i F!{k)). 

The right-hand side of (A.23) converges to as d — ?• oo since (A.21) holds 
with K replaced by «; + 5. □ 
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Before considering Fj^, the distribution of the position of the components 
in the rejection region after [d^] iterations, we introduce a simple random 
walk on the hypercube (RWH). The biggest problem in analyzing the RWM 
or pseudo-RWM algorithm is the dependence between the components. How- 
ever, the dependence is weak and whether or not a proposed move is accepted 
is dominated by whether or not the proposed moves lies inside or outside the 
hypercube. Therefore we couple the RWM algorithm to the simpler RWH 
algorithm. 

For d>l, define the RWH algorithm as follows. Let denote the po- 
sition of the RWH algorithm after k iterations. Then 

^ ' ' \wf, otherwise. 

That is, the RWH algorithm simply accepts all proposed moves which re- 
main inside the hypercube and rejects all proposed moves outside the hy- 
percube. Define the pseudo-RWH algorithm in the obvious fashion with 
— (^fc 1' ^fc 2' • • • ' ^fe d) denoting the position of the pseudo-RWH al- 
gorithm at iteration k. Then for 1 < i < d, W^_^i j = + (TdZf_^^ ^, where 

For our purposes it will suffice to consider the coupling of the pseudo- 
RWM and pseudo-RWH algorithms over [d^] iterations and study how the 
pseudo-RWH algorithm evolves over [d^] iterations. Note that the RWH 
algorithm coincides with the RWM algorithm with a uniform target density 
over the d-dimensional cube, so in this case the coupling is exact. 

The components of the pseudo-RWH algorithm behave independently. For 
X e (0, 1), let uJdix) =F{0<x + (JdZ < 1) and for x'' G (0, 1)'', let Orf(x'^) = 
'lYj^i^dixj)- Then f]^(x'^) is the probability that a proposed move from x*^ 
is accepted in the RWH algorithm. 

Lemma A. 6. For any a < ^ and x*^ E [0, 1]*^, there exists a coupling such 
that 

(A.25) d"P(Xf / Wf|X[J = W[J = x'^) ^0 asd^oo. 



Proof. Let U ~ f7[0, 1]; then we can couple Xf and Wf using and U 
as follows. Let 

x'^ + ddZf, if x'^ + ddZf G [0,1]^ 
x'', otherwise, 

f^'^ + ad'Li if x'^ + cjrfZf G [0, 1]'^ 



X? 



and [/ < 1 A exp {^2^g{xj + adZij) - g{xj)}j 



x'', otherwise. 
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Therefore, Xf / Wf if x"^ + cj^Zf G [0, l]'^ and C/ > 1 A eMJ2j=i{9ixj + 
o-rf^ij) -g{xj)}). Thus 

= d"P^x'' + (7Xe [O,!]"', 

[/ > 1 Aexp +crdZij) - g{xj)}^^ 



(A.26) 



{0<2:j+crdZij<l} 



1 - 1 A exp ( ^{5(2;^ + CTdZij) - gixj)} 
Vi=i / 



<d"E 



^{g{xj + cJdZij) - g{xj)} 
i=i 

since for all y G M, |1 — {1 A exp(y)}| < \y\. 

By Taylor's theorem, for I < j < d, there exists S,j lying between and 
(TrfZij such that 



(A.27) g{xj + adZij) - g{xj) = g'{xj)adZij + 



Since g{-) is continuously twice differentiable on (0, 1), there exists K < 00 
such that 



(A.28) 



^{g(,xj + (TdZij) - g{xj)} 



< 



+ 



2d 



Since the components of are independent, by Jensen's inequality, 
(A.28) and E[{X + cf] < 2E[X^] + 2c^, for any random variable X and 
constant c, we have that 

d"p(xf / wf |x;J = w;^ = x'^) 

(A.29) <^"(^2|^E^^'(^^)' 

— as d — 00, 
and the lemma is proved. □ 



4d2 
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Corollary A. 7. FixO<a<^-d. 

For any x*^ G [0, 1]"^, there exists a coupling such that 

(A.30) d"P |J{XjVw^^}|X^ = W^ = x^^O asd^oo. 

Moreover, if x'^ G F^ and a + 5 + ^ < ^ , there exists a coupling such that 
Ad'] \ 

(A.31) d"phj{X^"'/W^^}|X[5 = WEj = x^^0 asd^oo. 

For r>0 and fc = 0,l,2,..., let 

(A.32) x1ixf,r; k) = i^^ Hi ^ ^d given that W^q", = 

[0, otherwise. 

Let q'^{x;r;k)=K[xf{x;r;k)] and let Xd{'x.'^]r;k) = Y^j^-^^q'^{xj;r;k). Note 
that the movement of the components of the pseudo-RWH algorithm are 
independent. 

The next stage in the proof is to show that, if Xq is started in Fj^, then 
after k^ iterations of the pseudo-RWM algorithm has forgotten its starting 
value in terms of the total number and position of the components in i?^ 
(the rejection region). Moreover, the total number and position of the com- 
ponents in i?^ after k^ iterations of the pseudo-RWM algorithm are approxi- 
mately from the stationary distribution of X^^. Before defining and studying 
{F^}, we require the following lemma and associated corollary concerning 
the distribution of the components in the rejection region after k^ steps. 

Lemma A. 8. Let {k^} be any sequence of positive integers satisfying 
[d^]<kd<[d% 

For any sequence of {x'^} such that x'^ G i^J n Fj, 

d 

(A.33) d2T^g'^(xf;/;fcrf)2^0 asd^oo. 

i=l 

Also for all < x <1, 

(A.34) d'^^q'^{x;l;kd) ^0 asd^oo. 

Proof. Fix x'^ G F] n F| and set = x'^. 

To prove (A.33) and (A.34) we couple the components of to a simple 
reflected random walk process {Sf; t > 0}. Set Sq = x for some < x < 1. Let 
Zi, Z2, . . . be i.i.d. according to ?7[— 1, 1]. For i > 1, set Sf_^_^ = Sf + a^Zt+i 
with reflection at the boundaries and 1 so that Sf G (0, 1). For x G (0, 1), 

let p^(x; /, kd) = e ^Sl-^o^ = 
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Consider Wfi with identical arguments applying for the other components 
of . Since kdCTd — )■ as d — )• oo, we assume that d is such that {kd + l)cr(i < 



Then 



2 

(A.35) 



For X G (0, {kd + 1)0"^) U (1 — {kj, + 1) we can couple Sf and H^/^ such 
that 

(A.36) q\x;l;kd)<p\x;l;kd). 

For < y < 1 - fjrf, if 5^^ = Wf^-^ = y, then set Sf^^ = Wf^^^^ = y + aaZt+i . 
Now if y < (jrf (y > 1 — cr^), Z^+i and Zf_^_i i can be coupled such that, if Sf = 
Wf^^ = y, then Sf,^-^^ < Wf^^^^ {Sf_^_^ > W't+i^i). Furthermore, for yi < y2 < 
1/2 (yi > 7/2 > 1/2), the above coupling can be extended to give, if Sf = yi 
and Wf^ = y2, then Sf_^_l < Wf^-^ {Sf+i > W^j+i)- Since in k^ iterations either 
process can move at most a distance k^crd, (A.36) follows from the above 
coupling. 

Without loss of generality, we assume that < x < {kd + l)crd [symmetry 
arguments apply for 1 — (kd + 1)0",^ < x < 1]. By the reflection principle, 



p'^{x;l;kd) = Fi -ad < x + ad^Zi < ad 



(A.37) 



i=l 



X ^ 

-i< — +y^i<i 



By the Berry-Esseen theorem, there exists a positive constant, Ki < oo say, 
such that for all 2: € M, 



(A.38) 



i=0 



< 



where <!>(•) denotes the c.d.f. of a standard normal. Therefore it follows 
from (A.37) and (A.38) that there exists a positive constant, K2 < 00 say, 
such that for all x G (0, 1), 

K2 



(A.39) 



p''{x;l;kd)< 



By Hoeffding's inequality, for any e > 0, 



(A.40) 



i=l 



>ek^J^ < 2exp 



3/4 N 2 



2eM-e'^Jkd/2). 
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Hence for k^J^ /d < x < {kd + by taking e = 1/2/ in (A. 40), we have 

that 



(A.41) 



dp'^{x-l;kd) = dl 



< dF\ 



1=1 



j=i 



> 



3/4 \ 



< 2dexp 



8P 



2d 
>0 



as d — )• oo. 



Furthermore, note that for {kd + 1)0"^ < x < 1 — {kd + 1)0"^, p{x; I; kd) = 0. 

Then (A. 34) fohows immediately from (A. 36) and the above bounds for 
p{x; I; kd) since (^}^ l\/kd — )• as d — )• oo. 

FinaUy, for x'' G F] n FJ, it follows from (A. 36), (A.38) and (A.39) that 
there exists K-^ < oo such that 



d'^ ^ q\xf- 1; kd)' < fi'^ ^/(xf ; kd)' 



(A.42) 



1=1 



i=l 



<d27|^3fc3/4(^^y + 2dexp 



8/2 



with the right-hand side of (A.42) converging to as d — )• oo . □ 

Corollary A. 9. For any m>2, any sequence {r^} satisfying < < / 
and any sequence of positive integers {kd} satisfying [d^] < kd < [d^], there 
exists K <oo, such that for all d>l, 



(A.43) 



E[q''{X^,;rd;kd)n<Kd-^'+'''^/'l 



Proof. Fix m>2. Note that 

E[g^(Xo^,i; r,; A;,)-] < E[q\X^^,-l; kd)^ 

\Hx;hkd)'^f{x)dx 



(A.44) 



3/4 q%x-l-kd)'^f{x)dx 

d 

+ / ^3/4 q\x-,l-kd)'^f{x)dx. 
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The two terms on the right-hand side of (A. 44) are bounded using (A. 39) 
and (A. 41), respectively. Thus it fohows from the proof of Lemma A. 8 that 
there exist constants Ki,K2 < oo such that, for aU d > 1, 

E[/(Xo^,i;rrf; A;,)™] < j^^sj, (^^^^ f{x)dx 

,3/4 / /T, \ 

(A.45) <P(Xo^,iGi?> 

+ P«,^i?f' )x2exp(-^) 

<K,^k, +2exp^-^J. 
The corohary fohows from (A.45) since m >2 and kd>[d^]. □ 

We are now in position to define {F^^. For any < r < / and k G Z^, let 
(A.46) F|(r; A:) = {x^ : | Arf(x'^; r; k) - A(r)| < ^-^8}, 

where A(r) = /*r(l + r/2l). Let 

(A.47) F3 = |x'^: sup sup jAdCx'^; r; A;^) - A(r)| < d"^). 
K]<fcd<M'']o<''<' 

We study {F^{rd-,kd)} as a prelude to analyzing {F^^} where and kii are 
defined in Lemma A. 10 below. 

Lemma A. 10. For any sequence {r^} satisfying < < I, any sequence 
of positive integers {kd} satisfying [d^] < A;^ < [d^] and k> 0, 

(A.48) d^'FiX.^ i F^{rd, kd)) ^0 as d ^ oo. 

Proof. By the triangle inequality, 

d-ny^tiFi{rd;kd)) 

(A.49) <d'^P(|Arf(X^;rrf;A:rf)-E[Arf(X^;rd;A:rf)]| >d-Vl6) 

+ d'^nn^di^i^ Td-, kd)\ - A(r,)| > d~yi6) . 

In turn we show that the two terms on the right-hand side of (A.49) converge 
to as d — )■ oo . 
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By Markov's inequality, we have that for any m G N, 
d''P(|Ad(X^; rd-, kd) - E[Ad(X^; r^; kd)]\ > d~yw) 

' d N 



(A.50) 



\j=i 



IQm^K+my ^ . . . ^ E 



ll{qHXo,i/,rd;kd) 
-E[q''{Xo,,-rd;kd)]} 



Since the components of Xq are independent and identically distributed, we 
have for any {11,12, ■ ■ ■ , im}, there exists 1 < J <m and h,h, ■ ■ ■ ,h with 



■ m such that 



E 



(A.51) 



ll{q\Xo,i^ ; rd-, kd) - E[g'^(Xo,.^. ; r^; kd)]} 



j=i 



■-\{n{q''{Xo,m, kd) - n<f{Xox,rd; kd)]]'^]. 



Note that if any Ij = 1, then the right-hand side of (A.51) is equal to 0. 
By Corollary A. 9, if li,l2, ■ ■ ■ ,lj > 2, there exists Ki < 00 such that the 
right-hand side of (A.51) is less than or equal to Ylj=i{^i'^~^^~^^^^^^^} = 
K(d~'^ dr"^^/^ . Furthermore, there exists K2 < 00 such that for any 1 < J < m 
and li,l2, . . . ,lj >2, there are at most ^2^'^ configurations of {ii,i2, • • • , im} 
such that for j = 1,2, . . . ,J, Ij of the components are the same. Therefore 
there exists K <oo such that 



(A.52) 



il=l 



l[{q''iXo,^^ ; rd-, kd) - nqHXo,i, ; rd; kd)]} 



Taking m > k/(/3/8 — 7), it follows from (A.52) that the right-hand side 
of (A.50) converges to as d—)- 00. 

The lemma follows by showing that for all sufficiently large d, 

(A.53) |E[Arf(X^;rrf;fcrf)] - A(rd)| <d-Vl6. 

Note that 



E[Arf(X^; Td-, kd)] = d¥.[q\XoX.r; kd) 
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(A.54) 



/{x;rd;kd)f{x)dx 



+ d 



1 



q'^{x;rd;kd)f{x)dx 



q'^{x;rd;kd)fix)dx. 



By (A. 41), the second integral on the right-hand side of (A.54) is bounded 
above by dx 2exp(— y^/S/^) — as d — oo. Let f^, = supo<a.<i|/'(a;)|. Then 

by Taylor's theorem, for < x < k^^^/d, 



(A.55) 
Thus 

(A.56) 



\f{x)-fm<x sup ny)<hk'jVd. 

0<y<x 



\x;rd;kd)fix)dx- f{0)d 



q'^{x;rd;kd)dx 



u3/4 



<dxf. 



kr/d 



q'^{x; Td] kd) dx. 



Similarly, we have that 
'1 



(A.57) 



i-kl'^/d 



q'^{x;rd;kd)f{x)dx- f{l)d 



i-kT/d 



q {x;rd;kd)dx 



<dxf. 



u3/4 
"d 

d 



i~k'J^/d 



q'^{x; rd\ kd) dx. 



By symmetry, q'^{l - x;rd;kd) = q'^{x;rd;kd), so 



(A.58) d'' 



E[Ad(X^; Vd-, kd)] - 2rd / r^; kd) dx 



as d — >• oo. 



Since uid{y)dy > 1 — 2ad, using Lemma A. 8, (A. 34), we have that, for 
all sufficiently large d, 



d^ 



dj q'^{x;rd;kd)dx- d I q{x;rd]kd)—^^^^^ 

jo^d{y)dy 



dx 



(A.59) 



<Ad'+^ q\x;rd]kd)dx 
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d j {—^ — ^- 1 \q'^{x;rd;kd)dx 

. Jo My) dy 



< 4di+^a,d-2^ + d^+^ C ^^'^ r,; k,) dx. 

■JO Jo '^d(y) dy 

Let p'^{x]r(i]kd) be defined as in Lemma A. 8. Note that C/[0,1] is the 
stationary distribution of a reflected random walk on (0,1). Therefore for 
any > 1, 

(A.60) / p'^{x;rd-,k)dx= [ p'^(x; r^; 0) dx = 

JO Jo d 

Therefore, it follows from Lemma A. 8, (A. 36), that 

(A.61) d[ q'^{x;rd;kd)dx<d [ p'^{x;rd;kd)dx = 2rd. 

Jo Jo 

Hence the right-hand side of (A. 59) converges to as d— )• oo. 

Note that the stationary distribution of a single component of the pseudo- 
RWH algorithm has p.d.f. ujd{x)/ Jq ojd{y) dy (0 < x < 1). Therefore 

d [' q\x, Td-, kd)-p^dx = d C q\x- Td, 0)-p^dx 
Jo jQU}d{y)dy Jo Lujd{y)dy 

(A.62) 



21 J / \ 2d 

Finally, combining (A. 58), (A. 59) and (A.62), we have that (A. 53) holds and 
the lemma is proved. □ 

Lemma A. 11. For any k>^, 
(A.63) d^P(X[f i F|) ^0 asd^oo. 

Proof. Fix k > 0. Fix a sequence of positive integers {kd} such that 
< < [d^]- Fix e > 7 and let 5^ = {0, d'^ 2^"^ . . . , [ld^]d-^ , I]. Thus 
the elements of Sd are separated by a distance of at most d~^ . 

For any < r < Z and d > 1, there exist r^, E Sd such that Td < r < rd 
with rd — fd< d~^ . By the triangle inequality, 

|Ad(X^;r;fcd)-A(r)| 

< Ad(X^; r; kd) - Arf(X^; f^; kd) + | Ad(X^; f^; kd) - X{fd)\ 
+ A(r)-A(frf) 

< Xd(^t-,fd;kd) - Xd{^t-,fd;kd) + \Xd{^t-.rd;kd) - X{rd)\ 
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(A.64) 



<\Xd{X.i;fd;kd)-E[Xd{X.^;rd;kd)]\ 
+ 2| Ad(X^; kd) - E[Arf(X^; u; kd)] \ 
+ 2|A(fd)-A(fd)|. 
By Lemma A. 10, for any sequence {r^} satisfying < r^; < / 



(A.65) d'^+^+h 
Hence 

(A.66) d'^+^pf : 
For all sufficiently large d, 



\Xd{X^;rd;kd)-Xird)\ > 



max|Ad(Xo;r;/cd) - A(r)| > 







as d — )• oo. 



as d — >• oo. 



sup \X{rd) - X{sd)\ < — 

\<d-o 16 



(A.67) 

0<rd,Sd<l,\rd-Sd\ 

Therefore it follows from (A.64), (A.66) and (A.67) that 

(A.68) d^+^rf sup \Xd{X.^;r;kd)-X{r)\>d~A ^0 asd^oo. 

^0<r<l ' 

Since (A.68) holds for any sequence {kd\ satisfying \df^\ < kd < [d^], the 
lemma follows since 



(A.69) 



d^¥{ sup sup |Arf(Xg;r;A;)- A(r)| >d~^ 

^[dl^]<k<[d^] 0<r<l 

V P(sup \Xd{Xi,r;k)-X{r)\>d-A. 



□ 



Finally, we consider 



(A.70) 



The sets mirror the sets {-Fn} in [14] and are used when considering 

G^H{x^) and G^^^ H(x'^) but play no role in analyzing Pd- 

Lemma A. 12. For any K>d, 
(A.71) d^P(X[f ^ Fj) ^ asd^oo. 
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Proof. Let g* = supo<y<i |5''(2/)| and fix k > 0. Then by Hoeffding's 
inequality, 

/ d \ 



d^FiX.^ i Fi) = d"" 

(A.72) 



> 



< d'' X 2exp( — — — -7 —^0 asd— >oo. ,-, 
V d{g*)y □ 

Finally we are in position to consider {F^} and {F^}. Recall that, for 
d>l,F, = FinF|nF|nFiand 

Frf = jx'^; P U X^^ ^ i^dlX^ = x'^) <d-^^. 

Combining Lemmas A.l, A. 5, A. 11 and A. 12, we have the following the- 
orem. 

Theorem A. 13. For any k > 0, 
(A.73) d^'FiX.^ Fd) ^ asd^oo. 

Hence, by Lemma A. 3, for any k > 0, 
(A.74) d'"P(X;^ ^ Fd) ^ asd^oo. 

Also using the couplings outlined above, we have that 

(A.75) F\\J{WjiFd]\WieFdj^Q asd^oo. 



APPENDIX B: PROOF OF Pd\^o = X^ EXP(-LF*/2) 
We show that for any sequence {x'^} such that x'^ G F^, 
(B.l) Pd|X^ = x'^^exp(-//72) asd^oo. 

The key result is Lemma B.l which states that after kd iterations, the con- 
figuration of the components in the rejection region i?^ resemble the configu- 
ration of the points of a Poisson point process with rate A(r) = /*r(l + r/2/) 
on the interval [0, 1] . 

For any n G N and 1 < z < n, let 
d 

5^(x^; i; k) = J2{xf{xj;il/n; k) - x^xf, {i - l)l/n; k)] 
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with 

S^(x'^; k) = {Sii^^; 1; A:), ^^^(x'^; 2; fc), . . . , S^{^^; n; A;)). 

Let S„ = (S,^, 5^, . . . , S*") where the components of S„ are independent Pois- 
son random variables with ~ Po(A„^i) and 

A„^i = X{il/n) - X{{i - l)l/n) {l<i< n). 

Lemma B.l. For any n G N, any sequence of positive integers {k^} sat- 
isfying [d^] <kd< [d^] and x'^ G F^, 

S^(x'^; /crf) S„ asd^oo. 

Proof. Fix n G N and x*^ G Fd- Let 

S^(x'^;M = (S;?(x'^;l;M,S;?(x'^;2;A;,),...,S;^(x'^;n;A:,)), 

where for 1 <i <n, S!^{'K'^;i; k^) are independent Poisson random variables 
with means 

Xi^.i^"- kd) = A,(x'^; il/n- k^) - Arf(x'^; {i - l)//n; k^). 
The lemma is proved by showing that 

dTv{^i{^'';kd),Sn) < dTv{Si{^''-kd)X{^''-M) 
(B.2) +dTv{K{^''\kd),Sn) 

— )• as (i — )• oo. 

By [1], Theorem 1, 

d 

(B.3) dTv iKi^"-. kd)X{^^; kd)) < «''(^^; ^; ^^)'- 

i=l 

By Lemma A. 8, (A. 33) the right-hand side of (B.3) converges to as d — )• oo. 

For the second term on the right-hand side of (B.2), it suffices to show 
that 

S^ix"^; kd) Sn asd— J-oo. 

(For discrete random variables convergence in distribution and convergence 
in total variation distance are equivalent; see [2], page 254.) 

The components of S^(x'^;A;rf) and are independent, and therefore it 
is sufficient to show that, for all 1 < i < n, 

(B.4) Sn{yL'^;i\kd) Sn,i asd^oo. 

For ah 1 < f < n, (B.4) holds, if 

(B.5) Xf,^i{x'^;kd)-^ Xn,i asd^oo. 

Therefore the lemma follows from (B.5) since [d^]<kd<[d^] and x'^gF|. 
[See (2.17) for the construction of {-P'J}-] □ 
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Lemma B.l is the key result stating that if the pseudo-RWH process is 
started from the set F^, then after [d^] iterations the distribution of the com- 
ponents in the rejection region are approximately given by S„. We show that 
studying the pseudo-RWH algorithm over [d^] iterations suffices in analyzing 

Td{^) = ES'"'' MjM^^j))- Note that Pa satisfies 

(B.6) Td{Pd)<l<Td{Pd + l/[d^]). 

Let Trf(7r) = pjj X^j-^o~^' Before establishing a coupling be- 

tween Td{iT) and T(i(7r), we give a simple coupling for geometric random 
variables. 

Lemma B.2. Suppose that < q < p < 1 and that X and Y are inde- 
pendent geometric random variables with success probabilities p and q, re- 
spectively, that is, X ~ M[p) and Y ~ M[q). Let A be a Bernoulli random 
variable with W[A = 1) = q/p and Z ~ M{q). Then if A, X, Y and Z are 
mutually independent, 

(B.7) Y = X + A)Z. 

Therefore there exists a coupling of X and Y such that 

(B.8) p(X/y) = P(A = 0) ^"^ 



p 

Lemma B.3. For any < tt < 1 and x*^ E F^, there exists a coupling of 
T(i{tt) and Trf('/r) such that 

(B.9) P(rrf(7r)/frf(7r)|X^EEW^ = x'^)^0 asd^<x,. 

Proof. For x'^ e F^, by Corollary A. 7, we have that 

(B.IO) phj{Xj^/Wj"'}|X^ = W^ = xM ^0 asd^oo. 

Suppose that for j = 0, 1, . . . , [d^] , = G F] . Then using Lemma B.2, 



(B.8), Mj(Jrf(X^)) and Mj(f]d(Wf)) can be coupled such that 



"d(X^- j 

(B.ll) 

Since X^ G F], J^d(X^) > 2-"'^°^'^ > rf-^, the right-hand side of (B.ll) is less 
than d^{17rf(X^) - Jd(X^)}. Note that 
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so by Lemma A. 6 for any a < ^, d"^ times the right-hand side of (B.ll) 
converges to as d — )• oo. Taking a such that 5 + 7 < a < ^, 



(B.12) 



j=0 



as d — )• 00. 
The lemma then fohows from (B.IO) and (B.12). □ 

We show that it suffices to study fd(7r) = j^jj T,f=o~^^ Jld(W^)"^ In other 

words, replace the mean of the geometric random variables {M {^j_(Wq)) , 
M(f^d(Wf)), . . . ,M(r2rf(W|^^^4_^j))} by the mean of the means of the geo- 
metric random variables. 

Lemma B.4. For any < tt < 1 and for any sequence o/jx"^} such that 

G Fd, Trf(7r)|W^ = x'^ A 7rexp(/*//2) ^/^i(^)|W^ = x'^ A 7rexp(/*//2) 
as d — )• 00 . 

Proof. Let Ad = \jf2o{^j i Fd}. Then for any x"^ G Fd, P(Ad|W^ = 
x*^) — 7- as d — )• 00. 

For any r G M with i = \/— 1, the characteristic function of Td{iT) condi- 
tional upon and = x"^ is given by 

E[exp(zTfrf(7r))|^J,W^ = x'^] 

-[wd'^-l] 



(B.13) 



n ^ 



expf^M,(0,(Wf))) A^,{W'^} 



A^,Wo^ = x'^ 



TT — - 



exp{iT/[d^])Qd{Wf] 



^^,W^ = x'^ 



^.^0 . {l-ndiW'j))eM^r/m' 
Conditional upon A^ , Jld(W^)-i < 2t'°s^ < d^. Hence, for aU < j < [vrd^ - 1], 



(B.14) 



exp{iT/[d^])nd(W'j) 



IT 



l-(l-0,(Wf))exp(zT/[d^]) [rf^] 



1 



Thus E[exp(iTfd(7r))|A^,W^ = x'^] has the same limit as (i — t- 00 (should 
one exist) as 

(B.15) 



E 



; ^ — ^ 
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which in turn has the same hmit as d — )• oo as 

■[7rd'5-l] 



E 



(B.16) 



j=0 



= E[exp(irr,(7r))|A^,W^ = x'^]. 
The lemma follows since P(^^| Wq = x'^) — )• 1 as d — )• oo. □ 

We shall show that Td{'K) — ^ exp(//*/2) as (i — )• oo using Chebyshev's in- 
equality in Lemma B.9. We require preliminary results concerning 
cov(rjrf(W^)~-'^, 0(i(W^_,_^)~-^|WQ = x'^) with the key results given in Lem- 
ma B.8. First, however, we introduce useful upper and lower bounds for 
rirf(x'^)~^ which allow us to exploit Lemma B.l and prove uniform Integra- 
bility {fd{^)}. 

For n G N, 1 < i < n and x"' G (0, 1)'^, let lYi-x!^) = 6*7" (x'^) - 6j~^^'/"(x'^) 
with b^lx"^) = {h''/{-x!^)X/{-s!^), . . . ,6;j'"(x'^)). Forn G N and s = (si, ss, • • • , 
Sn) G M", let 



(B.17) 



(B.18) 



^n(s) = n 

i=i 



1 J - 1 
- + 

2 2n 



^>n(s) = nf^+ ^ 



2n 



Then for ah x'^ G (0, l)*^, 

(B.19) ^n(bS(x'^)) < ^di^")-^ < i>n(bS(x'^)) < 2^^*(-'). 

Lemma B.5. For any m G N, any sequence of {x'^} such that x*^ G Fd 
and any sequence of positive integers {kd} satisfying [d^\ ^ k(i< [d^], 



(B.20) E[(2 



:x'^]^exp((2"-l)A(/)) asd^oo. 



Proof. Note that {6|^(Wf J| = x'^} = Y,'j=i Xj{xj\h kd)- Then since 
the {xj{xj;l:,kd)} are independent Bernoulli random variables, 



(B.21) 



d 
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By Lemma A.8, (A.33), for x"^ G F^, Yfj=i<f{xi'^hkdf ^ as d ^ oo, so 
the right-hand side of (B.21) has the same limit as d— t- oo as 

d 

(B.22) W^MiT - = exp((2"^ - l)Ad(x'^; A:,)). 

i=i 

The lemma follows since for any x"^ G Fd, Arf(x'^; /; kd) — )• A(/) as d — > oo. □ 

Lemma B.6. Fix m,n G N. For any sequence {x'^} such that x*^ G Fd, 
and any sequence of positive integers {kd} satisfying [d^\ ^ kd < [d^], we 
have that 

E[m„(S^(x'^; kd)r] ^ E[m„(S„)'"] as d ^ oo, 
E[z>„(S:^(x'^;A:rf)n ^E[C(S„n as d^oo. 

Proof. By [6], Theorem 29.2, and Lemma B.l 

(B.23) i>„,(S^(x'^; kd)r ^ i^ni^nT as d ^ oo, 

(B.24) z>n(S^(x'^; kd)r ^ U^nT as d ^ oo. 

The lemma follows since (B.19) and Lemma B.5 ensure the uniform integra- 
bility of the left-hand sides of (B.23) and (B.24). □ 

Lemma B.7. For any sequence {x*^} such that x"^ G Fd and sequence of 
positive integers {kd} satisfying [d^\ <kd< [d^\, 

(B.25) E[Od(W^J-^|W;^ = x'^] ^exp(/*//2) asd^oo. 

For any x"^ G Fd and sequences of positive integers {id} and {kd} satisfying 
[d^]<kd<[d^] andid + kd<[d% 

(B.26) md{y^Uk,r^\^t^t=A^^Mn/'^) asd^^. 

Proof. An immediate consequence of Lemma B.6 is that 
hm nun{^i{^''\kd))l lim E[i>„(S^(x'^;fcd))] ^exp(r//2) as n ^ oo, 

from which (B.25) follows by (B.19). 

By Theorem A.13, (A.75), P(Wf^ G Fd\W'}^ G F^) ^ 1 as d ^ oo, so (B.26) 
follows from (B.25). □ 

Lemma B.8. For any sequence {x*^} such that x"^ G Fd and any se- 
quences of positive integers {id} and {kd} satisfying [d^] < id,kd < [d^], 

(B.27) cov(0,(Wfj-i,0,(Wf )-i|Wo^ = x'^)^0 asd^oo 
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and 



var(^],(Wt)-^|Wo^ = x" 



(B.28) 



exp(/*^{41og2- |}) -exp(/*0 asd^oo. 



Proof. Using (B.19), Lemma B.5 and Markov's inequality, it is straight- 
forward to show that for any S > 0, there exists K <oo such that 

(B.29) P(J^d(W^J-^ > K\W^ = x"^) < P(2''^(^^d) > K\W^ = x*^) < 6. 

Therefore it follows from Lemma B.7 that, for any sequence {x'^} such that 

(B.30) - E[0,(W^^+,J-^|W^ = ^']}\Wi = x'^ 

— ^ as (i — 7- oo. 

The uniform integrability of the left-hand side of (B.30) follows from (B.19) 
and Lemma B.5. Hence (B.27) follows. 

It is straightforward to show that E[m„(S„)^], E[i>„(S„)^] exp(/*/(41og 2- 
3/2)) as n — 7- oo. Therefore from (B.19) and Lemma B.5, we have that 

(B.31) E[Orf(W^J-2|w;^ = x'^]^exp(/*/(41og2-3/2)) asd^oo. 

Then (B.28) follows immediately. □ 

We are now in position to prove Lemma B.9, which is the final step 
in proving that for any sequence {x*^} such that x'^ G F^, P^IXq = x"^ 

exp(-/*//2) as d^oo. 

Lemma B.9. For any < tt < 1 and any sequence {x*^} such that x"^ G F^, 
(B.32) Td(7r)|W^ = x'^^7rexp(/*//2) as d^oo. 

Proof. Fix a sequence {x'^}. Let f}{7r) = prj Ejt^^ and 
let f|(7r) = E;='^7]" ^d"^^)-'. Thus Un) = Ti(vr) + f J(7r). 

Let Ad = Ylflii^j i Fd\- By Theorem A.13, (A.75), P(Ad|W;J = x'^) ^ 

as d — )• oo and conditional upon A^, Tj(7r) < ^^^jj-- Hence T^(7r)|WQ = 

x'^ as d— )• oo. 

By Lemma B.7, (B.25), 

E[T|(7r)|Wo^ = x'^] = -^ E[0,(W^^)"i|W^ = x'^] 

(B.33) 

^7rexp(r//2). 
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By Chebyshev's inequality, for any e > 0, 

P(|r|(7r)-E[rJ(7r)|Wo^ = x'^]|>e|W^ = x'^) 

(B.34) 

[nd^-l] [nd'~l] 

^ J j=[d0] l=[df] 

Since for all j, I, 

cov(17,(W^^)-\l7,(Wf)-i|Wo' = x") 

(B.35) 

< var(Q,(W^^)-i|Wo^ = x'^)i/2 var(Q,(Wf)-i|Wo^ = x'^)i/2^ 
it is straightforward to show, using Lemma B.8, that the right-hand side 
of (B.34) converges to as d ^ oo. Thus fJ(7r)|W^ = x*^ ^ 7rexp(/*V2) 
as d — )• oo and the lemma follows immediately. □ 

Theorem B.IO. For any sequence {x'^} such that x'^ G F^, 
(B.36) Prf|X[j = x'^^exp(-/*//2) as d^oo. 

Proof. For any < vr < 1, by Lemmas B.3, B.4 and B.9, 
(B.37) T,(7r)|X^ = x-^ 7rexp(r//2). 

Since Pd satisfies Td{Pd) < 1 < Td{Pd + l/[d^]), for any e>0, 
P(|Prf-exp(-rV2)|>e|X^ = x'^) 

(B.38) < F{Td{eM-rim " ^^/2) > 1|X^ = x'^) 

+ P(rd(exp(-/*//2) + e/2) < 1|X^ = x'^) 

for all sufficiently large d. The lemma follows, since (B.37) ensures that the 
right-hand side of (B.38) converges to as li —t- oo. □ 

APPENDIX C: PROOF OF (2.13) 

From Appendix B, we have that for any sequence {x'^}, such that x.'^ £ Fd, 

P^IXq = x'^ exp(— Z/*/2) as d — )• oo. Therefore we proceed by showing 
that, for any < vr < 1, 

(C.l) sup |G'J^//(x'^) -7rG'//(xi)| ^0 asd^oo, 

where //(x"^) = jfjE[(//(X|^^^,]) - F(X^))|X^ = x'^] is defined in (2.21) 
and 

(C.2) GH{x) = ^ll^y{x)H'{x) + lH"{x)y 

Equation (2.13) will then be proved using the triangle inequality. 
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We analyze GdH(X'j) = d'^E[H{'Xf - X.^)\X.^ = x'^], which is defined in 

(2.22), before using (2.21) to study GyH{^'^). However, first we require 
some definitions and preliminary results. Throughout we will utilize the 
following key facts noted in Section 2: H'{0) = if'(l) = and that Hl,H^ < 
oo, where ff^* = supo<y<i|i^'(?/)| and iJ| = supo<j^<i|i^"(y)|. 

We follow [7] and [10] in noting that, for any function h which is a twice 
differentiable function on R, the function z i— ?• 1 A e^^^^ is also twice differ- 
entiable, except at a countable number of points, with first derivative given 
Lebesgue almost everywhere by the function 



d 



1 A e'*^^) 



0, 



if h{z) < 0, 

dz [0, if/i(z)>0. 

The second derivative can similarly be obtained but will not be explicitly 
required for our calculations. 

For — 1 < 2; < 1, let JJ(x'^) denote the probability of accepting a move in 
the RWM algorithm given that Zi^i = z and let 



(C.3) 



exp ^^^"^^(a^j + (^dZi,j) - g{xj)}j 



X 1 



{E,t2(9(^j+adZi,,)-5(x,))<0} n l{0<Xj+<7dZi,,<l} 

i=2 



Then for all x'^, using Taylor's theorem, 

(C.4) Jli^^) = l{o<.,+.,.<i}{ J^^x-^) + a,g'{x,)zJ!{^'') + 0{aj)}. 

Therefore for xi G {ad, 1 — era), 

(C.5) J,(x'^) = j0(x'^) + O(a2). 



Lemma C.l. 
(C.6) 



sup 



J0(x'^) 



J0(x« 



as d 



00. 



Proof. Let ^%^'^) = E[n^, 

and let r2°(x"') = E[n^^2 l{o<x-j+t7dZi,,<i}], the probability a proposed move 
stays inside the unit cube given that the first component does not move. The 
proof of (A. 26) can be adapted to show that, for any a < ^, d°|r2j](x'^) — 
- Jd{^'^)\ ^0 as d^oo. Therefore since for x'' G Fd, 



jf(x'^),f]0(x'^) >exp(-/g*)(i-T, (A.3), we have that 
(C.7) sup Wr.i - ^0 asd- 



sup 



J0(x'^) 170(x'^ 



00. 
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Let i3d(x'^) = {2<j< d-Xj e R^J and let /^(x'^) = Ej^i3,(x<') Wl^^jO^i 
Since |;Srf(x'')| < ^logd, we have that 



•J- 



(C.8) 



< {-f log d)adg*. 



Yl iaixj + CTdZij) - g{xj)) 
Then using a Taylor series expansion, there exists K < oo such that, for all 

(C.9) /.(x'^) - ^ < J2{g{x, + aaZ,) - g{x,)) < + ^^^^'^ 



d 



j=2 



d 



Since Zi^i, ^1^2, • • • j are independent, and whether or not a proposed move 
from x'^ stays inside the hypercube depends only upon ;B'^(x'^), 

J70(x'^)P(/,(x'^)<-i^logd/d) 

(C.IO) 

< Q^diA < n^di^FildiA < Klogd/d). 
For all x-^ e F,, i ^ IE[5'(^i)'], so 

\/d/d(x'^) A Af(0,E[c/'(Xi)2]) asd^oo. 
Therefore it follows that 

(C.ll) sup -- ^0 asd-^00 



sup 



O0(x'^) 



with the lemma following from (C.7) and (C.ll) by the triangle inequality. 

□ 

Lemma C.2. For xi e {ad, 1 - <Td) and x"^ G Fd, 
(C.12) GdH{^'') = + + 



jO(x^)- 



where — )■ as d — )• 00 . 
For xi e i?^, 

(C.13) 



|G,if(x'^)|<|i/2*^'- 



Proof. For d > 1, fix x"^ G Fd and suppose that xi S (cj^, 1 — cJd). Then 

GdH{^'') = d^E[H{±i) - H{±^o)\±t = x'^] 
(C.14) 



d^ 



-E 



(fl(x^ + ..Z^)-g(x-))|lA -^'"7:f''> | 

I 7rd(x°') J 



The right-hand side of (C.14) is familiar in that it is the generator of 
the RWM-algorithm divided by the acceptance probability; see, for exam- 
ple, [14], page 113. 
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First, note that 

2 

H{xi + UdZi) - H{xi) = <JdZiH'{xi) + ^ZlH"{xi) 

+ ''lzl{H"{xr+^i)-H"{x,)}. 
Using (C.4), (C.5) and noting that < xi + (JdZi < 1, we have that 



X E 



\adZiH'{xi) + '^ZlH"{xi) 
+ dzl{H"{x, + i^i)-H"{x^)}^ 

X { JO(X^) + jO(x'^)W(rEi)Zi + 0(a2)}l^o<xi+<x,Zi<l} 

<jdnzi]H'{xi) 



J0(x'^) + O(a2) 

(C.15) + joff+ ^nz!{H"{x, + - H"ix,)}] 



The first term on the right-hand side of (C.15) is 0. Since ff| < oo, by the 

continuous mapping theorem, {H"{xi + ^pf) — H"{xi)} as d — )■ oo and 
then since Zi is bounded the third term on the right-hand side of (C.15) 
converges to as d—)- oo. For x'' G F^, J^(x'^) > e"'^ d~'^ , and so, the right- 
hand side of (C.15) equals 

where — ?• as d — t- 00. Thus (C.12) is proved. 

The proof of (C.13) follows straightforwardly using Taylor series expan- 
sions since H'{0) = H'{1) = 0. □ 
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Since g* = snpQ^y^i\g' {y)\, , H2 < 00, an immediate consequence of 
Lemma C.2 is that, there exists K* < 00 such that 

(C.16) sup sup \GdH{x.'^)\<K*. 

Lemma C.3. For any sequence of positive integers {kd} satisfying [d^^] < 
kd<[d^], 

(C.17) sup \E[GdH{±i J\±^ = ji.'^]-GH {xi)\^0 asd^oo. 

Proof. Fix {kd} and note that 
E[GdH{Xl)\±t = ^''] 

(C.18) = Fi±l E Fd\±i = ^')E[GdHi±l)\±t = ±1 e Fd] 

+ ¥{±1 i Fd\±t = x'^)E[G',/?(Xf J|X^ = x^Xf^ ^ Fd]. 

Since H eV, = supo<y<i \H{y)\ < 00. Therefore, for aU y'^ G [O,!]'^, 
GdH{y'')<2d^H*^. By (2.14), sup^.^^^ d^p^x^^ ^ ^^jx^ = x'^) ^ as d^ 
00. Thus the latter term on the right-hand side of (C.18) converges to as 
d — >• 00. 
Now 

nGdH{±l)\±t = ^'',±t^Fd] 

=nxti^Rd\^t=^''M,^Fd) 

(C.19) X E[G,//(Xf J|X^ = x^Xf^ G Fd.Xl^, i R^,] 

+nxti^R'd\y4=^''Xt^Fd) 

Consider first the latter term on the right-hand side of (C.19). By Lemma C.2, 
(C.13), 

(C.20) nGdH{±i;)\±t = x^X^^ e Fd,Xti e R'd] < ¥^Hl 
Note that 



(C.21) 



< 



P(X^^GF,|X^=x'^) 

n^Li£-Rd|Xg = x'^) 
P(Xf^GF,|X^ = x'^) • 



By (2.14), for x'^ G Fd, P(Xf^ G Fd\±^ = x"^) ^ 1 as d ^ 00. Use Corol- 
lary A. 7 and Lemma A. 8 to show that ¥{X^^ ^ G Rd\^o = x"^) ^ as d ^ 00. 
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Hence, the right-hand side of (C.21) converges to as d — )• oo and conse- 
quently the latter term on the right-hand side of (C.19) converges to as 
d — oo. 

It follows from the above arguments that 
(C.22) min P(Xf^,i ^ i?^, Xf^ G F,|X^ = x-^) ^ 1 as^i^oo. 

Also it follows from (C.16) that there exists K <oo such that 
(C.23) sup sup nG,H{±l)\±t = ^'',XlG F,,Xl^,^ R',]< K. 

Therefore, it is straightforward using (C.18), (C.19) and the triangle inequal- 
ity to show that 

sup \nGdH{±l)\±t = A 

(C.24) -nGdH{±i^)\±i = ^^±t^Fa,Xl^,iR',]\ 

— )• as d — )• oo. 
By Lemma C.2, (C.12), there exists — t- as d — t- oo, such that 

sup nCdHiM^ - GH{Xl.)\±t = x^Xt G Fd.Xl, i R',]\ 



I' 



(C.25) 



< V sup \g'{y)H'{y)\ 

O 0<y<l 



X sup E 
< ^^g*Hl sup 



J Ay') 1 



Xg - x'', X^^ G Fd, X^^ i ^ i?^ 



+ 4 



+4- 



By Lemma C.l, the right-hand side of (C.25) converges to as oo. 
Using the triangle inequality, the lemma follows by showing that 

sup \nGH{Xl^,)\±^, = x^Xf^ G F,,Xl^, i R\\ - GB(xr)\ 

x<^GFd 

(C.26) 

— )• as d — oo. 

Note that x — x\\< kdcra, and so, (C.26) follows since GH[-) is contin- 
uous. □ 

We are in position to prove (C.l). 

Lemma C.4. For any < vr < 1, 



(C.27) 



sup |gJ^(x'^) -7rGi7(xi)| ^0 as d^oo. 

x<*GFd 
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Proof. Since (C.27) trivially holds for vr = 0, we assume that vr > 0. 
For all sufficiently large d, by the triangle inequality, 



(C.28) 



J j=0 



< 



[7rd*-l] 



+ 7^ E \nGdH{±'^)\Xt = A-GHix,) 



+ vr 



GH{xi). 



Since 



nGdH{±j)\±t=A 

(C.29) =nGdH{±'j)\±t = ^\±i e Frf]P(X^"' G Fd|X^ = x'^) 

it is straightforward, following a similar argument to the proof of Lemma C.3, 
(C.23), to show that there exists K < co such that, for all < j < [d^], 



(C.30) 



sup |E[Qi/(X^^)|X^ = x'^]|<i?. 



Therefore the first term on the right-hand side of (C.29) is bounded by 
By Lemma C.3 the supremum over x*^ S of the second term 
on the right-hand side of (C.28) converges to as d — >• oo and the lemma 
follows. □ 



Corollary C.5. 
(C.31) sup sup IG^/ix."^) -ttGH{xi)\^0 asd^oo. 



0<1T<l^d 



Proof. Fix e > and let = {0,e, 2e, . . . , [l/e]e, 1}. It follows from 
Lemma C.4 that, for all sufficiently large d, 



(C.32) 



max sup |gJ^(x'^) -7rGF(xi)| <e. 
Ten, ^d^^^ 
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Consider any < vr < 1. There exists vf G such that vr < vr < vf + e. By 
the triangle inequaUty, 

\G'fH{^'')-7TGH{x,)\ 

(C.33) < \G^fH{^'^) - GfH{^'^)\ + IcfHi^"^) - nGH{xi)\ 

+ (7r-fr)|Gi/(xi)|. 
Again by the triangle inequality, 



sup |G^'"if(x'^)-Gj"i/(x° 



(C.34) 



[7rd*-l] 



< 



Since for all sufficiently large d, {[ird^ - 1] - [itd^])/[d^] < 2e, it follows 
from (C.30) that the right-hand side of (C.34) is bounded by 2K£, where K 
is defined in Lemma C.4. 

Let K = 2K + 1 + supo<y<i \GH{y)\. Note that since 5*, ifj* , < 00, we 

have that K < 00. Therefore it follows from (C.33) that for all sufficiently 
large d, 

(C.35) sup_ \G^/H{x'^) - 7rGH{xi)\ < Ke. 

Since (C.35) holds for all < vr < 1 and e > 0, the lemma follows. □ 

Finally we are in position to prove (2.13), and hence complete the proof 
of Theorem 2.1. 



Lemma C.6. 
(C.36) sup |G^i?(x'^) -Gi/(xi)| ^0 



as d^ 00. 



Proof. Note that G^i^(x°') is given by (2.8) and GH{xi) = exp{-lf*/2) x 
GH{xi). Therefore by the triangle inequality, 

sup \G^aH{x'^)-GH{xi)\ 
d^ 



= sup 
x^eFd 

< sup 

x'^G-Fd 



E 



E[i7(Xf^^,,,) - i?(Xg)|Xg = x'^] -exp(-Z/72)Gi7(xi) 



— (HiXfp^^s^) - if(X^)) - P,Gif(xi)|X^ = x'^ 
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+ sup \E[PdGHixi)\±^o = x-^] - exp{-ir/2)GH{xi)\ 

(C.37) 

|Jj(i?(X|t,.j) - H{±',)) - 7rG//(xi)|X^ = x'^ 



< sup sup 

0<7r<lxdg_F^ 



E 



+ sup |E[Prf|X^ = x'^]-expH/72)| sup 
x^eFd 0<?/<i 

< sup sup \G^/H{x'^)-7rGH{xi)\ 

0<7r<lxdeF^ 

+ sup |E[Pd|X^ = x'^]-exp(-//72)| sup \GH{y)\. 

By Corollary C.5, the first term on the right-hand side of (C.37) converges 
to as d —7- oo. By Theorem B.IO, for any sequence {x*^} such that x"^ G F^, 

P^^IXq = x"^ — ^ exp(— //*/2) as d — 7- oo. Hence the latter term on the right- 
hand side of (C.37) converges to as d—)- oo, since g*,Hl,H2 < oo implies 
that supo<y<i \GH{y)\ < oo. □ 

Acknowledgments. We thank the anonymous referees for their helpful 
comments which have improved the presentation of the paper. 
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